Empirical Evaluation of Tree distances for Parser Evaluation
نویسنده
چکیده
In this empirical study, I compare various tree distance measures – originally developed in computational biology for the purpose of tree comparison – for the purpose of parser evaluation. I will control for the parser setting by comparing the automatically generated parse trees from the stateof-the-art parser (Charniak, 2000) with the gold-standard parse trees. The article describes two different tree distance measures (RF and QD) along with its variants (GRF and GQD) for the purpose of parser evaluation. The article will argue that RF measure captures similar information as the standard EvalB metric (Sekine and Collins, 1997) and the tree edit distance (Zhang and Shasha, 1989) applied by Tsarfaty et al. (2011). Finally, the article also provides empirical evidence by reporting high correlations between the different tree distances and EvalB metric’s scores.
منابع مشابه
Studying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملGrammar & Parser Evaluation in the XTAG Project
In this paper we discuss several methods used to evaluate the XTAG parser and English grammar. We consider the methods proposed in the literature for grammar and parser evaluation, and give some empirical reasons for electing to use certain methods over others. We propose a general framework for evaluation, which is then used to evaluate the English grammar and parser developed as part of the X...
متن کاملMDA Support for Constraint Checking Framework in EJB
Syntax Tree run attribute evaluator run LALR(1) parser Textual Constraints Concrete Syntax Tree Model constraints loaded [no evaluation errors] hasModelAndCst model loaded
متن کاملTree Distance in Answer Retrieval and Parser Evaluation
The use of syntactic tree-distance as a surrogate for semantic distance in an answer retrieval task is investigated. The feasibility of this is confirmed by showing that retrieval performance increases with parse quality, and an application of this to parser evaluation is discussed. Variant definitions of tree-distance involving parameters such as whole vs sub-tree, node weighting, wild-card tr...
متن کاملAn Evaluation of Parser Robustness for Ungrammatical Sentences
For many NLP applications that require a parser, the sentences of interest may not be well-formed. If the parser can overlook problems such as grammar mistakes and produce a parse tree that closely resembles the correct analysis for the intended sentence, we say that the parser is robust. This paper compares the performances of eight state-of-the-art dependency parsers on two domains of ungramm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1409.0314 شماره
صفحات -
تاریخ انتشار 2014